Selecting provisioning targets for new virtual machine instances

ABSTRACT

One embodiment of a method for provisioning a new virtual machine instance based on the content of an image of the new virtual machine instance includes identifying, from among a plurality of host machines, the host machine having the highest percentage of the content available in local storage, and provisioning the new virtual machine instance on the host machine having the highest percentage of the content available in local storage. Another embodiment of a method for provisioning a new virtual machine instance based on an image of the new virtual machine instance includes constructing at least a portion of the image using data stored locally on a target machine hosting the new virtual machine instance, and completing the image using data obtained over a network from remote storage.

BACKGROUND OF THE INVENTION

The present invention relates generally to cloud computing and relates more specifically to the provisioning of virtual machines in the cloud.

A virtual machine is a software implementation of a machine (e.g., a computer) that executes programs like a physical machine. When a new virtual machine instance is to be provisioned in the cloud containing multiple hypervisor host machines, one must determine which of the host machines is best suited to host the new instance.

Typical placement algorithms identify the best suited host machine based on resource availability at the host machine (e.g., central processing unit, disk, bandwidth, and/or memory availability). For instance, a placement algorithm may divide each host machine into a fixed number of “slots” (i.e., a certain number of cores and memories) and allocate virtual machine instances to free slots (e.g., based on round robin, lowest slot number first, or other allocation schemes).

Once a target host machine is selected, the virtual machine instance is provisioned by first copying the virtual machine image from a storage server to the target host machine. This process consumes network and storage server bandwidth and adds latency to the provisioning process. Notably, virtual machine provisioning time is a key metric of cloud elasticity (i.e., ability to handle sudden, unanticipated, and extraordinary loads), and cost minimization is closely tied to resource usage.

SUMMARY OF THE INVENTION

One embodiment of a method for provisioning a new virtual machine instance based on the content of an image of the new virtual machine instance includes identifying, from among a plurality of host machines, the host machine having the highest percentage of the content available in local storage, and provisioning the new virtual machine instance on the host machine having the highest percentage of the content available in local storage.

Another embodiment of a method for provisioning a new virtual machine instance based on an image of the new virtual machine instance includes constructing at least a portion of the image using data stored locally on a target machine hosting the new virtual machine instance, and completing the image using data obtained over a network from remote storage.

BRIEF DESCRIPTION OF THE DRAWINGS

So that the manner in which the above recited features of the present invention can be understood in detail, a more particular description of the invention may be had by reference to embodiments, some of which are illustrated in the appended drawings. It is to be noted, however, that the appended drawings illustrate only typical embodiments of this invention and are therefore not to be considered limiting of its scope, for the invention may admit to other equally effective embodiments.

FIG. 1 is a block diagram illustrating one embodiment of a system for selecting provisioning targets for new virtual machine instances, according to the present invention;

FIG. 2 is a block diagram illustrating an exemplary embodiment of the hypervisor of FIG. 1 in greater detail;

FIG. 3 illustrates an exemplary similarity matrix;

FIG. 4 is a flow diagram illustrating one embodiment of a method for selecting a provisioning target for a new virtual machine instance, according to embodiments of the present invention;

FIG. 5 is a flow diagram illustrating one embodiment of a method for provisioning a new virtual machine instance, according to embodiments of the present invention and

FIG. 6 is a high-level block diagram of the provisioning method that is implemented using a general purpose computing device.

DETAILED DESCRIPTION

In one embodiment, the invention is a method and apparatus for selecting provisioning targets for new virtual machine instances. Embodiments of the invention construct the required image for a new virtual machine instance using a set of local images already stored on the target host machine (i.e., for virtual machine instances already running on the target host machine). This reduces the amount of data that must be copied over the network, since only the portions of the required image that are not already present locally on the target host machine need to be copied. In turn, the amount of time and resources required to provision the new virtual machine instance may be substantially reduced. Thus, a target host machine is selected based at least in part on the similarity between the image of the new virtual machine instance and the images of the virtual machine instances already running on the host machine.

FIG. 1 is a block diagram illustrating one embodiment of a system 100 for selecting provisioning targets for new virtual machine instances, according to the present invention. The system 100 is configured to receive requests for the provisioning of new virtual machine instances and to provision those requests by selecting a target host machine (e.g., hypervisor) that is best suited to host the new virtual machine instance based on image similarity. To this end, the system 100 generally comprises a provisioning manager 102, a plurality of hypervisors 104 (hereinafter collectively referred to as “hypervisors 104”), and a storage server 106.

The provisioning manager 102 comprises a processor that receives and allocates provisioning requests. The provisioning manager 102 is in communication with the hypervisors 104 or with agents deployed on the hypervisors 104.

The hypervisors 104 comprise virtual machine managers that allow guest operating systems to be hosted and managed on host computers. For instance, one or more of the hypervisors 104 may be installed on a server. Taking the hypervisor 104 ₁ as an example, each hypervisor 104 includes a plurality of slots 108 ₁-108 _(m) (hereinafter collectively referred to as “slots 108”) and a direct attached storage 110. As discussed above, each of the slots 108 comprises a certain number of cores and memories to be allocated by the hypervisor 104 ₁ to virtual machine instances. The direct attached storage 110 contains virtual machine images (i.e., files containing the complete contents and structures representing virtual machine instances) of virtual machine instances currently running on the hypervisor 104 ₁. Additional details of the hypervisors 104 are illustrated in FIG. 2.

The hypervisors 104 are further in communication with the storage server 106, which includes an image library. The image library includes virtual machine images for a plurality of virtual machine instances, including the virtual machine instances currently running on each of the hypervisors 104.

FIG. 2 is a block diagram illustrating an exemplary embodiment of the hypervisor 104 ₁ of FIG. 1 in greater detail. As discussed above, the hypervisor 1041 includes a plurality of slots 108 and a direct attached storage 110. In addition, the hypervisor 104 ₁ includes a virtual machine creator 112 and a similarity matrix 114.

The virtual machine creator 112 is an agent (e.g., a software agent or a processor) that communicates with the provisioning manager 102 in order to determine whether the hypervisor 104 ₁ is best suited to host a new virtual machine instance. The virtual machine creator 112 is also in communication with the storage server 106, the direct attached storage 110, and the similarity matrix 114. The virtual machine creator 112 tracks the images for virtual machine instances running on the hypervisor 104 ₁ and constructs the similarity matrix 114. If the hypervisor 104 ₁ is selected for the provisioning of the new virtual machine instance, the virtual machine creator 112 cooperates with the provisioning manager 102 to establish the new virtual machine instance.

The similarity matrix 114 tracks information regarding the image types that are available locally to the hypervisor 104 ₁, so that a comparison can be made to the image of a new virtual machine instance, as discussed in greater detail below. The virtual machine creator 112 may update the similarity matrix 114 (e.g. , periodically, on demand, or in response to a change in the direct attached storage 110).

FIG. 3 illustrates an exemplary similarity matrix 114. As illustrated, the similarity matrix 114 stores data about images and clusters stored on a hypervisor. As used herein, a “cluster” refers to a portion of an image; an image is thus made up of one or more clusters. Clusters may vary in size. Moreover, the same cluster may appear in multiple different images.

For instance, the similarity matrix 114 includes a column that lists a plurality of exemplary clusters according to their cluster identifiers (Cl01-CL-20). Along the row for each cluster identifier, the images containing the associated cluster are identified by their image type identifiers (1-10). A zero at the intersection of a cluster identifier and an image type identifier indicates that the corresponding image does not contain the corresponding cluster (e.g., image 1 does not contain cluster CL-02, among others); a one at the intersection of a cluster identifier and an image type identifier indicates that the corresponding image does contain the corresponding cluster (e.g., image 1 does contain cluster CL-01, among others).

In FIG. 3, clusters CL-01-CL-10 are referred to as “singletons.” Singleton clusters are clusters that occur in only a single image. By contrast, clusters CL-11-CL-20 each occur in multiple images.

FIG. 4 is a flow diagram illustrating one embodiment of a method 400 for selecting a provisioning target for a new virtual machine instance, according to embodiments of the present invention. In one embodiment, the method 400 may be performed by the provisioning manager 102 or a general purpose computing device as illustrated in FIG. 1 and discussed below.

The method 400 begins in step 402. In step 404, the provisioning manager 102 receives a request to provision a new virtual machine instance.

In step 406, the provisioning manager 102 sends a message to the virtual machine creators 112 of each hypervisor 104 to inquire which of the hypervisors 104 have empty slots that can accommodate the new virtual machine instance. In step 408, the provisioning manager 102 receives a plurality of responses from the virtual machine creators 112 indicating which of the hypervisors 104 have empty slots.

In step 410, the provisioning manager 102 selects a hypervisor 104 having an empty slot. In step 412, the provisioning manager 102 sends a message to the virtual machine creator 112 of the selected hypervisor 104 requesting the percentage of the required image (i.e., the image required for the new virtual machine instance) that is available locally at the selected hypervisor 104 (e.g., via the direct attached storage 110). In one embodiment, this percentage is based on the number of clusters occurring in the required image that are shared by images of virtual machine instances already running on the selected hypervisor 104.

In step 414, the provisioning manager 102 receives a response from the virtual machine creator 112 containing the percentage of the required image that is available locally at the selected hypervisor 104. The provisioning manager 102 then determines, in step 416, whether there are any additional hypervisors 104 having empty slots that have not yet been contacted to determine what percentage of the required image they store locally.

If the provisioning manager 102 concludes in step 416 that there are additional hypervisors 104 having empty slots that have not yet been contacted, then the method 400 returns to step 410 and proceeds as described above with the provisioning manager selecting a next hypervisor 104 having an empty slot.

Alternatively, if the provisioning manager 102 concludes in step 416 that all of the hypervisors 104 having empty slots have been contacted, then the method 400 proceeds to step 418, and the provisioning manager 102 identifies the hypervisor 104 having the highest percentage of the required image available locally. The provisioning manager then provisions the new virtual machine instance on the identified hypervisor in step 420. The method 400 ends in step 422.

FIG. 5 is a flow diagram illustrating one embodiment of a method 500 for provisioning a new virtual machine instance, according to embodiments of the present invention. In one embodiment, the method 500 may be performed by the virtual machine creator 112 of a hypervisor 104 or a general purpose computing device as illustrated in FIG. 2 and discussed below.

The method 500 begins in step 502. In step 504, the virtual machine creator 112 receives a message from the provisioning manager 102 inquiring whether the hypervisor 104 has any empty slots that can accommodate a new virtual machine instance. In step 506, the virtual machine creator 112 determines whether the hypervisor 104 has any empty slots.

If the virtual machine creator 112 concludes in step 506 that the hypervisor 104 does not have an empty slot, then the virtual machine creator 112 sends a negative response to the provisioning manager 102 in step 510. The method 500 then ends in step 522.

Alternatively, if the virtual machine creator 112 concludes in step 506 that the hypervisor 104 does have an empty slot, then the virtual machine creator 112 sends an affirmative response to the provisioning manager 102 in step 508.

In step 512, the virtual machine creator 112 receives a message from the provisioning manager 102 requesting the percentage of the required image (i.e., the image required for the new virtual machine instance) that is available locally at the hypervisor 104 (e.g., via the direct attached storage 110).

In step 514, the virtual machine creator 112 computes the percentage of the required image that is available locally at the hypervisor 104. As discussed above, in one embodiment, this percentage is based on the number of clusters occurring in the required image that are shared by images of virtual machine instances already running on the hypervisor 104. In one embodiment, the virtual machine creator 112 consults the similarity matrix 114 for the data necessary to compute the percentage. In step 516, the virtual machine creator 112 sends a response to the provisioning manager 102 including the computed percentage.

In optional step 518 (illustrated in phantom in FIG. 5), the virtual machine creator 112 receives a message from the provisioning manager 102 requesting that the new virtual machine instance be provisioned on the hypervisor 104. The virtual machine creator 112 then provisions the new virtual machine instance on the hypervisor 104 in optional step 520 (illustrated in phantom in FIG. 5). In one embodiment, provisioning the new virtual machine instance includes using images or clusters that are available locally on the hypervisor (e.g., in the direct attached storage 110). In a further embodiment, the locally available images or clusters provide only a portion of the required image, and any images or clusters that are not available locally are obtained from remote storage (e.g., the storage server 106) to complete the required image. The method 500 ends in step 522.

The invention disclosed herein thus minimizes provisioning time and resource usage by selecting target host machines based at least in part on image redundancy. By constructing the required image for the new virtual machine using as much locally stored data as possible, the amount of data that must be copied over the network can be significantly reduced.

FIG. 6 is a high-level block diagram of the provisioning method that is implemented using a general purpose computing device 600. In one embodiment, a general purpose computing device 300 comprises a processor 602, a memory 604, a provisioning module 605 and various input/output (I/O) devices 606 such as a display, a keyboard, a mouse, a stylus, a wireless network access card, an Ethernet interface, and the like. In one embodiment, at least one I/O device is a storage device (e.g., a disk drive, an optical disk drive, a floppy disk drive). It should be understood that the provisioning module 605 can be implemented as a physical device or subsystem that is coupled to a processor through a communication channel.

Alternatively, the provisioning module 605 can be represented by one or more software applications (or even a combination of software and hardware, e.g., using Application Specific Integrated Circuits (ASIC)), where the software is loaded from a storage medium (e.g., I/O devices 606) and operated by the processor 602 in the memory 604 of the general purpose computing device 600. Thus, in one embodiment, the provisioning module 605 for provisioning new virtual machine instances, as described herein with reference to the preceding figures, can be stored on a tangible or physical computer readable storage medium (e.g., RAM, magnetic or optical drive or diskette, and the like).

It should be noted that although not explicitly specified, one or more steps of the methods described herein may include a storing, displaying and/or outputting step as required for a particular application. In other words, any data, records, fields, and/or intermediate results discussed in the methods can be stored, displayed, and/or outputted to another device as required for a particular application. Furthermore, steps or blocks in the accompanying figures that recite a determining operation or involve a decision, do not necessarily require that both branches of the determining operation be practiced. In other words, one of the branches of the determining operation can be deemed as an optional step.

While the foregoing is directed to embodiments of the present invention, other and further embodiments of the invention may be devised without departing from the basic scope thereof. Various embodiments presented herein, or portions thereof, may be combined to create further embodiments. Furthermore, terms such as top, side, bottom, front, back, and the like are relative or positional terms and are used with respect to the exemplary embodiments illustrated in the figures, and as such these terms may be interchangeable. 

What is claimed is:
 1. A method for provisioning a new virtual machine instance, the method comprising: sending, by a processor, a message to each of a plurality of host machines, the message including an inquiry into whether the each of the plurality of host machines has an empty slot to host the new virtual machine instance, wherein the empty slot comprises a number of cores and a number of memories to be allocated to the new virtual machine instance; receiving, by the processor, a response message from each of the plurality of host machines, the response message indicating if the each of the plurality of host machines has at least one empty slot to host the new virtual machine instance; identifying, by the processor, a subset of the plurality of host machines, wherein each host machine in the subset of the plurality of host machines has at least one empty slot available to host the new virtual machine instance; sending, by the processor, a second message to each respective host in the subset of the plurality of host machines, the second message including an inquiry into a percentage of an image of the new virtual machine instance available in local storage on the respective host, wherein the image comprises a plurality of clusters; calculating, by each respective host and in response to receiving the second message, the percentage of the image of the new virtual machine instance available in local storage on the respective host, the calculating utilizing a consultation to a similarity matrix comprising: a plurality of rows, each of the plurality of rows corresponding to a given cluster of the plurality of clusters; a plurality of columns, each of the plurality of columns corresponding to a given image of a plurality of images; and at each intersection of one of the plurality of rows and one of the plurality of columns, an indicator indicating whether the given cluster occurs in the given image; receiving, by the processor, a second response message from each respective host in the subset of the plurality of host machines, the second response message indicating the percentage of the image of the new virtual machine instance available in local storage on the respective host; identifying, by the processor, from among the subset of the plurality of host machines, a host machine having a highest percentage of an image of the new virtual machine instance available in local storage; and provisioning, by the processor, the new virtual machine instance on the host machine having the highest percentage of the image available in local storage, wherein the provisioning comprises instructing the host machine having the highest percentage of the image available in local storage to run the new virtual machine instance.
 2. The method of claim 1, wherein the image comprises at least one cluster.
 3. The method of claim 2, wherein the identifying comprises: determining, for each of the plurality of host machines, a number of clusters of the at least one cluster that are available in local storage.
 4. The method of claim 3, wherein the determining comprises: sending a message to an agent running on each of the plurality of host machines, the message requesting that the agent compute a number of clusters of the at least one cluster that are available in local storage; and receiving a response from the agent, the response including an indication of the number of clusters of the at least one cluster that are available in local storage.
 5. The method of claim 1, wherein the provisioning comprises: directing the host machine having the highest percentage of the image available in local storage to construct at least a portion of the image using locally stored data.
 6. The method of claim 5, wherein the locally stored data comprises at least one cluster stored on the host machine having the highest percentage of the image available in local storage.
 7. A method for provisioning a new virtual machine instance at a host machine, the method comprising: receiving, by a processor of the host machine from a provisioning manager, a request for a computation of a percentage of an image of the new virtual machine instance, wherein the image comprises a plurality of clusters, that is available in data stored locally at the host machine and a request for an indication of whether the host machine has at least one empty slot available, wherein the at least one empty slot comprises a number of cores and a number of memories to be allocated to the new virtual machine instance; sending, by the processor, at least one response to the provisioning manager, the at least one response including the percentage and the indication, the percentage determined by a consultation to a similarity matrix comprising: a plurality of rows, each of the plurality of rows corresponding to a given cluster of the plurality of clusters; a plurality of columns, each of the plurality of columns corresponding to a given image of a plurality of images; and at each intersection of one of the plurality of rows and one of the plurality of columns, an indicator indicating whether the given cluster occurs in the given image; receiving, by the provisioning manager, a second response message from each respective host in a subset of a plurality of host machines, the second response message indicating the percentage of the image of the new virtual machine instance available in local storage on the respective host; receiving, by the processor, an instruction from the provisioning manager to provision the new virtual machine instance when the host machine has a highest percentage of the image available in local storage of the host machine and when the host machine has the at least one empty slot available; constructing, by the processor, at least a portion of the image using the data that is stored locally on the host machine; and completing, by the processor, the image using data obtained over a network from a remote storage.
 8. The method of claim 7, wherein the at least a portion of the image is constructed from those of the plurality of the clusters that are available in the data stored locally.
 9. The method of claim 8, wherein those of the plurality of clusters that are available in the data stored locally are identified using a similarity matrix stored on the host machine.
 10. The method of claim 9, wherein the similarity matrix identifies, for each image of each virtual machine instance running on the target machine, one or more clusters required to produce the each image.
 11. The method of claim 10, wherein the similarity matrix further identifies a file size for each of those of the plurality of clusters that are available in the data stored locally.
 12. The method of claim 11, wherein at least two of those of the plurality of clusters that are available in the data stored locally have different file sizes.
 13. The method of claim 9, wherein the similarity matrix is stored locally on the host machine.
 14. The method of claim 9, wherein the similarity matrix is updated periodically.
 15. The method of claim 9, wherein the similarity matrix is updated on demand.
 16. The method of claim 9, wherein the similarity matrix is updated in response to a change in the data stored locally.
 17. The method of claim 7, wherein the indicator is a zero when the given cluster does not appear in the given image, and the indicator is a one when the given cluster does appear in the given image. 